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Abstract 

We study the price dynamics of 65 stocks from the Dow Jones Composite Aver¬ 
age from 1973 until 2014. We show that it is possible to define a Daily Market 
Volatility cr(t) which is directly observable from data. This quantity is usually 
indirectly defined by r(t) = a{t)uj{t) where the r{t) are the daily returns of the 
market index and the uj{t) are i.i.d. random variables with vanishing average 
and unitary variance. The relation r{t) = a{t)u){t) alone is unable to give an 
operative definition of the index volatility, which remains unobservable. On the 
contrary, we show that using the whole information available in the market, the 
index volatility can be operatively defined and detected. 
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It is well know that stock market returns are uncorrelated on lags larger than 
a single day. This is an inescapable consequence of the efficiency of markets. On 
the contrary, absolute returns have memory for very long times, a phenomenon 
known as volatility clustering. These phenomena are very well documented in 
the literature and known as stylized facts [3 mail]- 

Furthermore, there is a large empirical evidence that volatility auto-correlations 
decay hyperbolically isEiiaiais] and there is also a growing evidence that 
volatility signals have a multi-fractal nature [loiiniiiaiisiiiiiiis]. 

Daily historical volatility is an unobservable variable and is usually mea¬ 
sured by the absolute value of daily returns which are instead observable. This 
definition gives only an approximation of the real volatility a(t) which can be 
indirectly defined by r{t) = a{t)uj(t) where r{t) are the daily returns of the 
market index and the ui{t) are i.i.d. random variables with vanishing average 
and unitary variance. The relation r(t) = a{t)uj(t) alone is unable to give an 
operative definition of the index volatility, which remains unobservable. We will 
show that using the whole information available in the market, the index volatil¬ 
ity can be operatively defined and detected, i.e., we will define an observable 
volatility for a market index which exhibits all the statistical features expected 
for this variable. 

The daily returns of single stock (say a) are given by ra{t) = ln[S'„(t)/S'Q(t— 

1)] where Sa{t) is the closing price of stock a at day t. Then, if one wants to 
extract volatility from data one can consider that Va (t) = Ua (t)uja {t) where the 
a;(t) have vanishing averages and unitary variance. Volatility aa(t) can be even¬ 
tually extracted considering the high frequency (intra-day) continuous trading, 
but the problem remains highly unsolved because of the overnight contribution 
to the return ra(t) for which there is not continuous trading. Therefore, the 
best measure of (historical) daily volatility simply remain the absolute returns 




If the aim is to measure the global volatility of a market, we will show that 
things can be different. One can address the problem considering volatility of 
a proper representative index. Nevertheless, if one considers a price-weighted 
index (as Nikkei 225) the main contributions will be artificially given by those 
stocks with a larger price. The problem is circumvented if one considers a 
capitalization-weighted index (as Hang Seng) or an equally-weighted index (as 
S&P 500 EWI). The daily return r{t) of this last index is simply the plain 
average of the returns of its components, i.e. 



( 1 ) 


where N is the number of stocks in the basket and ra{t) = ln(S'a(t)/S'Q(t—1)) 
and Sa{t) is the daily closing price of the stock a at day t. For the other 
two types of index the difference is that the average is weighed by price or by 
capitalization. 

Again, the underlying index daily volatility a(t) is not directly observable 
from daily returns but it is indirectly defined by r{t) = a(t)uj{t). Because of 
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market efRciency, it can be assumed that the u}{t) are independent identically 
distributed random variables with vanishing average and unitary variance. One 
could argue that a{t) is, indeed, observable from high frequency data, but, 
as already mentioned, the problem of the overnight contribution to the daily 
returns remains. 

Therefore, since daily market volatility is not objectively given by the index 
return (only the product is observable), its distribution depends on 

the model chosen for Gaussianity is often assumed as in ARCH-GARCH 
modeling (the leptokurticity of the distribution of returns is, in this case, entirely 
charged to volatility). Nevertheless, one can do other choices for the distribution 
of uj{t) as, for example, the uniform distribution (between —i/S and in order 
that the variance is unitary). 

In this paper we consider N = 65 titles of Dow Jones from 1973 to 2014 so 
that 1 < t < r ~ 10000. Dow Jones as an index is not equally weighted but 
we can construct ourselves an EW Dow Jones index for which the daily returns 
r(t) are simply the plain average of returns of its components as in definition 

0. 

The absolute returns are then given by 


|r(t)| = cr{t)\uj{t)\ 
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which are the absolute returns of the associated equally-weighted index (for 
price-weighted and capitalization-weighted indexes the only difference is that 
some weights must be introduced). 

The core of this paper is the definition of the volatility as 


cr(t) 
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N 

X 
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so that 


uj{t) 


r{t) 

(j{t) 


( 4 ) 


where rit) and a{t) are defined in equation 0 and equation 0. 

Most of the models assume that a{t) and |w(t-|-T)| are uncorrelated for any 
value of the lag r (negative, positive or vanishing) or they assume (as ARCH- 
GARCH) a short range correlation (a correlation only for small values of |t|.) 
Moreover |r(<)| and \ui(t -f r))| should be uncorrelated for any non vanishing r 
as well as |a;(t)| and \u}{t -I- r)|. 

Therefore, the first step is to show that all these properties holds according 
to our definition of volatility. After having computed the correlations = 
C{Lo{t),a{t + t)), Cc,uj = C{a[t),ui{t + t)) and also and C\r\^u:, 

we have plotted them in Fig. [l] As it can be well appreciated the four above 
considered cross-correlations and the auto-correlation substantially vanish. This 
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is better appreciated if compared with the volatility auto-correlation which 
is also plotted and which, on the contrary, exhibits a strong lag dependence and 
it is significantly positive for lags up to 250 working days. 



Figure 1: Auto-correlation of a, |a;| and all the cross-correlations between a, ut and |r|. It can 
be noticed that all except the auto-correlation of a are vanishing. 


Once shown that the variables uj(t) are independent from each other, from 
the absolute returns (2) and from the volatilities (3), we need to show that they 
have vanishing expected value ({uj{t)) = 0) and unitary variance ((a;^(t)) = 1). 
Indeed, we obtain a better result, in fact, the distribution of the uj{t) is uniform 
in the range [—-s/S, ■\/3] (which implies vanishing expected value and unitary 
variance but also implies (|w(<)|) = •\/3/2). This result, which is our second 
step, can be appreciated in Fig. where the empirical distribution or |w|, as 
defined by (4), is plotted. 

We still have a third step, to complete our argument. Having assumed 
complete independence of the uj{t) from the volatilities one forcefully has that 
the auto-correlation C\r\,\r\ only differs for a positive multiplicative constant 
0 < fc < 1 from the auto-correlation Ccr,a at any time lag r > 1. In fact, the 
auto-correlation of the absolute returns is 


C'|r|,|r| (f) 


(Kt + r)||r(t)|)-(|r(t)|}^ 
(r2(t))-(|r(f)|)2 


(5) 


then taking into account that |r(t)| = (T{t)\u}{t)\ and assuming mutual indepen¬ 
dence between the a{t) and the w(t), one has for any t > 1: 


C\r\,\r\{T) = kCa^air) 


( 6 ) 
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Figure 2: Uniform (almost) probability density of the Ital. One has iuP) = 0.924 ~ 1) and 
(leal) = 0.823 ~ V3/2 = 0.866. 


where Ca,aiT) is the volatility auto-correlation 


{a{t + T)a{t)) - (cr(t))^ 

{a^t)) - {a{t)y 


(7) 


and k is the constant 

3{aHt))/A-{a{t)y 


( 8 ) 


where we have used the uniform distribution value (|a;(t)|)^/(w^(t)) = 3/4. 
We compute from sample (o'^(t)) = 0.00008583 and (cr(t)) = 0.008388 so that 
k = 1/2.85. 

Moreover, following the same steps, one can easily compute 


C\r\,aiT) = Ca,\r\iT) = VkCa,a{T) (9) 

where k is the same value (8) previously computed (fc ~ 1/2.85) so that (6) and 
(9) are very strict requirements. It turns out that both relations (6) and (9) 
hold, in fact, the four correlations, rescaled by k (or by Vk), are plotted in Fig. 

where it can be appreciated that they are almost identical for all values of the 
lag T. 

Notice, that the coefficient k is not a fitting parameter, but it is in depen- 
dently derived by market data. The fact that after rescaling the four correlations 
coincide ultimately confirms that it is correct to write r(t) = a{t)uj(t) where w(t) 
and a{t) are the mutually independent variables we have defined. 
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Figure 3: Auto-correlations of a and \r\ and their cross-correlations. It is evident that the 
four correlations are (almost) identical once rescaled. 


In sum, we have defined the volatilities a{t) and, consequently, the variables 
uj{t) so that they (a) are mutually independent, (b) the Luit) are also indepen¬ 
dent from absolute returns, (c) the ui{t) are i.i.d. uniformly distributed with 
vanishing expected value and unitary variance, (d) the auto-correlation of the 
volatility exhibits a strong lag dependence and it is significantly positive for 
lags up to 250 working days, (e) the correct scaling of cross-correlations and 
auto-correlations involving the r{t) = a{t)uj{t) and the a{t) holds. Therefore, 
we have given a definition of daily market volatility which is observable and 
keep all the statistical features expected for this variable. 

We can conclude by saying that while for a single stock it is impossible to 
extract the volatility from return using r{t) = (T{t)uj(t), we have found a simple 
way to do that for an index using all absolute returns of the single stocks entering 
in the index. 
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